Tag
QGram Index Fibres
Tag to select a specific fibre (e.g. table, object, ...) of an q-gram index.
Tags
QGram_Text
The original text the index should be based on.
QGram_RawText
The raw text the index is really based on.
Remarks: QGram_Text and QGram_RawText fibres are equal by default. They differ if the index text is a set of strings. Then, raw text is the concatenation of all strings in this set.
QGram_SA
The suffix array.
Remarks: The suffix array contains the indices of all suffices of QGram_RawText in lexicographical order.
Fibre returns a String over the alphabet of the SAValue of TIndex.
QGram_Dir
The directory/hash table.
Remarks: The directory contains the start indices of the q-gram buckets. A q-gram bucket is a contiguous interval in the suffix array (QGram_SA). Each suffix in this interval begins with the same q-gram.
Fibre returns a String over the alphabet of a size type.
QGram_Shape
The shape the index is based on.
Remarks: The q-gram index needs an underlying Shape. This shape can be gapped or ungapped. The number of '1's (relevant positions) in the shape determines q and the size of the directory table.
Remarks
These tags can be used to get Fibres of an Enhanced Suffix Array based Index.
SeqAn - Sequence Analysis Library - www.seqan.de